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Abstract 

Multiple-choice load balancing has been a topic of intense study since the seminal paper of 
Azar, Broder, Karlin, and Upfal. Questions in this area can be phrased in terms of orientations 
of a graph, or more generally a A:-uniform random hypergraph. A (d, b) -orientation is an assign- 
ment of each edge to d of its vertices, such that no vertex has more than b edges assigned to it. 
Conditions for the existence of such orientations have been completely documented except for 
the "extreme" case of (k — 1, l)-orientations. We consider this remaining case, and establish: 

• The density threshold below which an orientation exists with high probability, and above 
which it does not exist with high probability. 

• An algorithm for finding an orientation that runs in linear time with high probability, with 
explicit polynomial bounds on the failure probability. 

Previously, the only known algorithms for constructing (k — 1, l)-orientations worked for k < 3, 
and were only shown to have expected linear running time. 

Key words. Multiple-choice hashing, random hypergraphs, orientations. 



1 Introduction 

The efficiency of many algorithms and data structures rests on the fact that randomly and inde- 
pendently throwing m balls into n bins ensures a distribution that is, with high probability, close 
to uniform. Since the seminal paper of Azar et al. [3] a large literature has grown around even 
stronger multiple- choice load balancing schemes where the location of each ball is selected within 
a random set of k > 1 bins. 

These problems have been studied both in the on-line setting, where balls and their possible 
locations are revealed one by one, and in the off-line setting where we are interested in the best 
allocation of a given set of balls. Most often, the focus of multiple-choice schemes is on minimizing 
the maximum number of balls contained in any bin. The question can also be turned around to ask 
for the largest number of balls that can be placed such that there are at most b balls in each bin. 
Of course, this number depends on the random choices made, but in the off-line setting it turns 
out that there is a well-defined threshold m = (Irk o(l))an, below which it is highly likely that the 
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allocation is possible, and above which it is highly unlikely that the allocation is possible. Here, a 
is a constant that depends on k and b, but not on n. 

In this paper we consider the scenario where each ball comes in d copies, and must be placed 
in exactly d (distinct) out of k possible bins. Observe that the case d = k is not so interesting, 
because it is equivalent to the single-choice case with md balls. Thus the interesting extreme case 
is d = k — 1, which is the focus of this paper. Motivation for copying each ball comes from parallel 
and distributed systems where we want high redundancy (resistance to d— 1 failures), and/or want 
to ensure that any set of balls can be accessed in parallel with only a single request per bin. Early 
papers investigating such schemes include [6j l22l I20j . As a more recent example, Amossen and 
Pagh [2] considered the case k = 3, d = 2, 6 = 1, and showed that up to ^-^-m balls can be placed 
with high probability^] for any constant e > 0. This was used to construct a data structure for sets 
that allows very fast computation of set intersections on graphics hardware. In this paper we show 
that the constant 6 in [2] cannot be reduced, i.e., that m = (1 ± o(l))n/6 is the threshold for the 
problem of allocating balls into 2 of 3 bins with maximum load 1. In fact, we generalize this result 
to the extreme case d = k — 1, 6 = 1 for any k > 2, giving explicit bounds on the probability of 
successful allocation in terms of m. We also present a generalization of the algorithms of [2j [18] 
that computes a (k — 1, l)-orientation (if one exists) of a given fe-regular hypergraph, and show that 
it runs in linear time with high probability. This strengthens [2] which only shows linear running 
time in expectation. 

1.1 Related work 

Multiple-choice balls and bins scenarios can be modeled as a random A:-regular hypergraph with m 
edges (balls) on n vertices (bins), where edges are chosen i.i.d. uniformly from the set of all fc-sets 
of vertices. Let H nm .k be the random /c-uniform hypergraph with n vertices and m hyperedges, 
where each such object is taken with equal probability. In the regime of interest in this work, 
when m is linear in n, there is essentially no difference between allowing and disallowing multiple 
edges, because for k > 3, the probability that the multi-hypergraph analogue repeats an edge is 
only 0(n _1 ). Given such a hypergraph, a (d,b) -orientation is an assignment of each edge to d of 
its vertices, such that no vertex has more than b edges assigned to it. 

On-line setting. In our description of the on-line setting, we restrict attention to the case where 
balls cannot be moved, once placed into bins. Azar et al. [3] considered (1, 6)-orientations in the 
on-line setting, and showed that the greedy algorithm that always assigns a ball to its least loaded 
bin achieves a (1, 0(m/n + log log mj log /c))-orientation. Tighter bounds for the maximum load of 
(1, 6)-orientations were later obtained by Berenbrink et al. [4]. 

Off-line setting. In the off-line setting, the threshold for (1, l)-orientations with k = 2 can 
be shown (see, e.g. [18J) to coincide with the appearance of a giant component in the random 
graph, which is known to happen at m = (1 ± o(l))n/2 with high probability [8]. Several groups 
of researchers [71 111} [T2] independently established the thresholds for (1, l)-orientations for every 
k > 2. Generalizing in another direction, Fernholz and Ramachandran [9] and Cain, Sanders, and 

1 Meaning probability tending to 1 as n — >• oo. 
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Wormald [5] showed thresholds for (1, 6)-orientations for k = 2, and gave expected linear time 
algorithms for computing an orientation. This result was later extended to k > 2 by Fountoulakis 
et al. |10| . Gao and Wormald [13] established thresholds for (d, 6)-orientations, given that b is 
a sufficiently large constant (depending on d and k). Independently of our work, Lelarge [16] 
recently developed new technical machinery for this problem, which handles all combinations of 
the parameters k, d, and b that satisfy max(/c — d,b) > 2. 

1.2 Our contribution 

In this paper we consider the remaining "extreme" case of max(fc — d,b) = 1, i.e., d = k — 1 
and 6 = 1. For this, we highlight two links between Probabilistic Combinatorics and (k — 1, 1)- 
orientations. First, we observe the connection between the literature on the phase transition in 
random hypergraphs and (k — 1, l)-orientations, which provides a natural explanation for the 
threshold phenomenon experimentally documented in [2]. Second, we derive explicit, quantita- 
tive high-probability bounds for the subcritical running time, by tracking a key parameter known 
as "susceptibility," through the Differential Equations method for analyzing discrete random pro- 
cesses. Previous bounds were only of expected-time type. Also, since we seek good polynomial-type 
dependencies in our probability bounds, we perform a more careful analysis of the susceptibility 
growth, which is substantially sharper than in previous published work (e.g., [21]) which was sat- 
isfied with error bounds that could tend to zero very slowly. Our main theorem refers to the 
pseudocode of the Orient algorithm, which can be found in section 13.11 Its running time is deter- 
mined by the number of iterations, which we define to be the number of times the condition in the 
while loop is evaluated. 

Theorem 1.1. Let < e < \ be given, and assume that ^ - > 40 °i° fc6 . Let m = (1 — e) fc( - fc "_ 1 - ) • 
With probability at least 1 — 3n~ x , all edges of the random k-uniform hypergraph H n ^ m .^ can be 
(k — 1, l)-oriented by the Orient procedure using a total of at most 



iterations, each taking constant time. 

This paper is organized as follows. The next section observes the natural threshold for extreme 
orientability. Then, Section 3 applies the Differential Equations method to deduce quantitative 
high-probability bounds for algorithmic performance in the feasible regime. The following (stan- 
dard) asymptotic notation will be utilized extensively. For two functions f(n) and g(n), we write 
f(n) = o(g(n)) or g(n) = w(/(n)) if lim^oo f(n)/g(n) = 0, and f(n) = 0(g{n)) or g(n) = ^(/(n)) 
if there exists a constant M such that |/(n)| < M\g(n)\ for all sufficiently large n. 

2 Non-orientability 

We now investigate why there is no (k — 1, fc)-orientation when the number of edges exceeds ^zry • 
This is done by exhibiting an obstruction that appears asymptotically almost surely as n approaches 
infinity. One may observe many types of possible obstructions to orientability. A simple example 




3 



for k > 3 is the /c-uniform hypergraph consisting of two hyperedges overlapping in three vertices. 
It is clearly impossible to pick k — 1 vertices for each hyperedge, as there are only 2k — 3 vertices 
to share. Unfortunately, any fixed -size obstruction has a threshold for appearance in Hn,m;k that 
is far beyond k (k-i) ' so one cann °t simply pinpoint a single such hypergraph as the culprit for 
non-orientability. 

Instead, we draw inspiration from the case k = 2 (often referred to as "cuckoo hashing" [18] ) 
where the desired threshold S matches the appearance of the well-studied giant component. Indeed, 
the seminal result of Erdos and Renyi [8] established that in the uniformly random graph with cn 
edges, for constants c < |, the largest connected component has size O(logn), whereas for constants 
c > ^, the largest connected component has size f2(n). Further study (see, e.g., the book [H] ) 
revealed that for c < i, all connected components are either trees or unicyclic (containing at most 
one cycle), whereas for c > ^, the giant component is multicyclic. As any multicyclic component 
would have too many edges for vertices to be (k — 1, l)-orientable, this would establish the result 
for k = 2. 

The remainder of this section translates the random graph literature into the orientability 
context, to observe the threshold for k > 3. First, it is convenient to introduce a measure of how 
"crowded" a component is. 

Definition 2.1. Let k > 3 be a fixed integer, and let H be a k-uniform hypergraph. The excess 
of H is the difference (k — l)e(H) — v(H), where v(H) and e(H) denote the numbers of vertices 
and edges in H , respectively. 

For connected hypergraphs H, the excess is always an integer greater than or equal to — 1. 
When it is —1, the hypergraph is acyclic, and called a hypertree. When the excess is 0, we say that 
H is unicyclic, and when the excess is positive, we say that H is complex. Note that in the context 
of (k — 1, l)-orientability, any complex component is an obstruction. Given an edge set E' we define 
its capacity as cap(£") = X^ey min(&, |{e G E' : v G e}|). We have the following consequence of 
the max-flow min-cut theorem (see, e.g. [191 Section 6.1]): 

Theorem 2.2. A k-regular hypergraph (V,E) has a (d,b)- orientation if and only if each subset 
E' C E has capacity cap(E') > \E'\d. 

Proof. The capacity sums, over each vertex, an upper bound on how many edges in E' can be 
oriented towards it. If some edge set E' has capacity less than \E'\d it is thus impossible to orient 
all its edges (even ignoring edges outside of E'). For the reverse direction consider the flow network 
with: 

• Node set E U V U {s, t}, i.e., a node per edge and vertex in (V, E), plus a source node s, and 
a sink node t. 

• Capacity 1 edges connecting the node of each e G E to the k nodes in V contained in e. 

• Capacity d edges from s to each vertex in E, and capacity b edges from vertex in V to t. 

Observe that an integer s-t flow corresponds to an orientation of edges with a flow of 1 from an edge 
to each vertex that the edge is oriented towards. This means that if there is no (d, 6)-orientation, 
there is no integer s-t flow of value \E\d. Since all capacities in the network are integral, this in 
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turn means that there exists no flow of value \E\d at all. Using the max- flow min-cut theorem this 
implies that there is a minimum s-t cut (S, T) such that the total capacity of edges from S to T is 
cut(5, T) < \E\d. Let E' denote the set of edges that are members of S. Since (S,T) is minimal 
vertices in V n S appear in at least b edges in E', and vertices in V n T appear in at most b edges 
of E 1 . Thus we obtain: 

cap(E') = b + Yl \{ e £ E' : v £ e}|) = cut(5, T) — \E\E'\d < \E'\d . 

□ 

For b = 1 the capacity of a set E 1 is exactly the number of distinct vertices in its edges, so 
the capacity of E' is (k — 1)\E'\ minus the excess of E' . This means that (k — 1, l)-orientability 
exactly coincides with the appearance of a complex component. Much is known about the phase 
transition in random hypergraphs. The following results are from the paper [15] of Karohski and 
Luczak, which actually determines several results of much higher precision. 

Theorem 2.3. (Theorem 4 in fl&j-) Let k > 3 be a fixed integer, and let m = k (£—i) ~ t( n )> 
where t(n) is any function of higher order than n 2 / 3 , i.e., t(n) = w(n 2 / 3 ). Then iT n ,m\h consists of 
hypertrees and unicyclic components with high probability (as n grows). 

Remark. The previous argument establishes that hypertrees and unicyclic components can be 
(k — 1, l)-oriented, although the running time for computing the orientation may increase with the 
component size. The earlier result of the second author established that in expectation, this could 
be done efficiently for m = (1 — e) k (f?-i) m ^ ne case k = 3. The connection above complements 
the earlier result by establishing feasibility, although not necessarily efficiency, when the number of 
edges differs from k ^_^ by a sublinear term. 

Theorem 2.4. (Theorem 10 in }15f.) Let k > 3 be a fixed integer, and let m = jm^rn + t{n), 

where t(n) is any function of higher order than ra 2 / 3 but smaller order than re2 ^ 3 ( i ^ > iog W ) 1 ^ 3 - Then 
with high probability, H n ^ m -k consists of one large complex component and some number of small 
components which are either hypertrees or unicyclic. 

Remark. Clearly, adding more edges only creates more complex components, so the upper bound 
on t(n) plays a role only in limiting the number of large complex components. 

Therefore, as soon as we exceed by even a sublinear deviation, an obstruction appears, and 

hence (k — 1, l)-orientability fails. Note that we cannot bound the size of the complex component, 
and in fact its size grows with n. There remains a window of width roughly n 2//3 between the lower 
and upper bounds. It is worth noting that for the case of graphs, this is also well-understood, and 
when m = m^rn + cr ^^ f° r (positive or negative) constants c, there is a constant probability of 
having a complex component. See, e.g., the discussion in the book [I]. 

3 High-probability running time bound 

In this section we present and analyze a simple algorithm for finding {k — 1, /c)-orientations. The 
first step is to observe that the running time to orient each new edge is 0(k 2 s), where s is the size 
of the connected component formed by the new edge. 



5 



3.1 Algorithm description 

The algorithm works by iteratively extending an orientation to more and more edges. We assume 
that vertices of each edge e can be traversed using methods e. first () (which returns an arbitrary 
vertex) and e.next(f) (which gives the next node in the order after v, cycling back to e. first () when 
all vertices have been traversed). For v E V let T[v] refer to the edge that is oriented towards v, 
where T[v] = _L if no edge is oriented towards v. We maintain an array indexed by V that initially 
has all entries set to _L. An edge e is directed to k — 1 vertices by calling the following procedure, 
generalizing the procedure of [2]. We use the notation -H- to indicate exchange of two variable 
values. 

procedure ORiENT(e) 
for i := 1 to k — 1 do 

r = e 

v = e. first () 
while t/1 

v = r.next(t> ) 

r O T[v) 
end while 
end for 

When Orient is called, each member of the set of previously oriented edges E\ appears k — 1 
times in T . The procedure runs a while loop k — 1 times that (if it terminates) inserts e in T[v] 
for some v € e, while ensuring that each edge e' G i?i is still oriented towards k — 1 positions in T. 
The invariant of the while loop is that all edges in E\ are oriented towards k — 1 vertices, and e is 
oriented towards i vertices, with one exception: If r 7^ _L the edge r which is oriented towards one 
vertex less. Clearly, once i = k — 1 and r = _L we have oriented all edges in E\ U {e}. 

We claim that the procedure always terminates if an orientation exists, and more specifically 
that the time spent if e is in a component of size s is 0(k 2 s). (Some stopping criterion is needed 
for termination in case no orientation exists, but this is left out for simplicity.) Suppose the while 
loop does not stop, i.e., it goes through an infinite sequence of edges. Let ex, e2, e^, . . . denote this 
edge sequence, with consecutive identical edges combined into a single occurrence. We observe that 
there can be at most k—1 consecutive iterations involving a particular edge. Notice also that edge 
shares at least one vertex with edge ej+i for each i. Consider a minimal subsequence Ci, . . . ,6j 
containing 3 such occurrences of some edge, and without loss of generality, assume that e = 1. Let 
t\ and £2, 1 < i\ < £2 < j, be the indexes of the edge in this subsequence that first appears for 
the second time (so £2 is minimal). Since all previously fully-oriented edges already are oriented 
towards all but one of their k vertices, one observes that then e^+t = e^-t for t = 0, . . . ,£± — 1. 
This means that the £2 — 1 distinct edges e%,... ,e^ 2 _i contain exactly (£2 — l)(k — 1) distinct 
vertices. From ei 2 to e^+^-i = ex, the edges encountered are all repeats (in reverse order) of those 
already seen. After e^+^-i (which is equal to ex) each new edge introduces at most k — 1 new 
vertices until we reach an edge that overlaps with a previously visited edge and only k — 2 vertices 
are introduced. At that point we have visited a set of edges having less than k — 1 available vertices 
on average, meaning that no (k — 1, fc)-orientation exists. 

Therefore, the length of the edge sequence is at most 2s, while the number of consecutive 
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identical edges consolidated into each element is at most k — 1. Since the while loop is run k — 1 
times for each new edge to orient, we conclude that the full orientation of the edge completes in 
0(k 2 s) time. 



3.2 Probabilistic tools 

We will need the following version of the Chernoff bound (see e.g. [17J): 

Theorem 3.1. For any < e < 1, every binomial random variable X with mean \i satisfies 

2 2 

P[X < (1-e)//] < e - V** and P [X > (1 + e)n] < e~ . 

A supermartingale is a sequence Xq,Xi, ... of random variables such that each conditional ex- 
pectation E [Xt+i | Xq, . . . , X t ] is at most X t . Azuma's inequality (see e.g. [17] ) states the following: 

Theorem 3.2. Let Xq, . . . ,X n be a supermartingale, with bounded differences \Xt+i — Xt\ < C. 
Then for any A > 0, 

r ^2 

[X n >X + A] < exp 



2C 2 



n 

3.3 Analysis of random hypergraphs 

Throughout, we impose explicit bounds that keep n "sufficiently large" in order to simplify our cal- 
culations. Recall that H nm -h is the random /c-uniform hypergraph obtained by uniformly sampling 
one with n-vertices and m hyperedges. In this section, it will be substantially more convenient for 
us to work with a process that exhibits more independence. Specifically, we consider instead the 
following sequential process, which fortunately is quite similar to the original H nm -k,. 

Lemma 3.3. Let n > k > 2, with n > 2000. Consider the random hypergraph process Hq, Hi, . . ., 
where Hq is the empty hypergraph with n isolated vertices. At each time t, sample k vertices 
independently and uniformly at random. If they are distinct, and form a hyperedge which does not 
yet appear in H t , then add it to form H t +\. Otherwise, let H t +\ = H t . Then, with probability at 
least 1 — n~ l , in the first k ^_^ rounds, the number of times that we do not add is at most logn. 

Proof. At time t + 1, a union bound shows that the probability that the k sampled vertices are 
not distinct is at most 

1 2 k-1 k(k-l) 
- + - + = -^- '-. 

n n n An 

Also, since t < m^rn ? the probability that the k sampled vertices form a previously-added hyper- 
edge is 

tkl (k - 2)1 1 
— < — < — 

where we used n > k for the final bound. Thus the probability that Ht+i = Ht is at most k ^ k ^ , 
and so the probability that this happens at least s = logn times in the first jm^rn rounds is at 
most 

^\ (k(k-m s (e-k^Y fk(k-l)Y _ fey _ ( e \ lo * n 



s J \ n ) \ s I V n ) \s) \logn 
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which is below n 1 for all n > e e2 . □ 

It is sometimes more convenient to work with the related model H n ,p;ki which is the random 
A;-uniform hypergraph formed by taking each of the (?) potential hyperedges independently with 
probability p. Fortunately, the behavior of H n p -k closely approximates that of H n rn .^. 

Lemma 3.4. Assume that < e < \, k > 2, and > 10 9j fc 2 . Let m = (1 — e) k ^-i) > an< ^ 

p = (1 — 0.8e) ^--i' ■ Then we may couple the probability spaces such that H n>m -k is contained in 
Hn,p;k with probability at least 1 — n^ 1 . 

Proof. Couple the probability spaces with the random hypergraph process, which corresponds to 
a uniformly random permutation of all (?) potential edges. Then, H n ^ m -^ corresponds to forming 
a hypergraph with precisely the first m edges in this permutation, and H n>p -k corresponds to 
generating an independent random variable X ~ Bin [()!), p] , and then taking the first X edges in 
the permutation. 

Therefore, it suffices to show that X > (1 — e) jm^n with high probability. We calculate 

fn\ , (n-k) k 



k) F - y ' 'k(k - l)n k ~ l ' 

_ n «t A 

fc(fc-l) ' 



Next, observe that if { n ^ k ) k > l-0.01e, then we will have E [X] > (1 - O.Sle)^^. The Chernoff 
bound (Theorem 13. ip would then give 



X < (1 



< e 



(0.19e) 2 n ns1 x n 

2 [l-0-»leJ fc(fc „i) 



k(k-l) 

Using e < ^, and n > 10 ^ fc logra, we conclude that this probability is at most re -1 ' 07 . It remains to 



show that > 1 — y5q. After rearrangement, we see that this is equivalent to 

n> (1) 

1 I 1 100/ 



However, e x < 1 — | for all < x < 1, so 



< e ioofc < 1 



V 100/ ~ 200/fc 

This, together with our assumption that n > 10 ° 2 fc2 log n > 20 ° fc2 , produces (pQ). □ 

Lemma 3.5. Lei < e < ^ and n > 2 °o fc2 . iei p = (1 — 0.8e) ^rjf ■ /n i/ie random hypergraph 
Hnp;k, with probability at least 1 — n™ 1 , aZZ connected components are of size at most ^^logn. 



Proof. Let V be the vertex set of the entire hypergraph. Let v be a fixed vertex, and let the 
random variable X v be the size of the connected component containing v. We generate X v by 
exposing hyperedges one at a time via breadth-first-search. Specifically, we maintain time-varying 
sets At of distinct active vertices and Bt of completed vertices, and build a labeling of the vertices 
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vi, V2, ■ ■ ■ , v n , initializing Aq = {v} and Bq = 0. At time t, we arbitrarily select a vertex w £ At (if 
At is empty, we stop), define the label vt = w, and set At+\ = At\ {w } and Bt+i = Bt U {tw}. Also, 
we expose all hyperedges which have exactly k — 1 vertices in V \ {t>i, . . . , «t-i}, together with w 
as the k-th vertex. Here, "expose" means that we reveal whether or not the potential hyperedge 
in fact appears in this particular realization of H n ^. Finally, for each vertex other than w which 
is in a newly exposed hyperedge, we add a single copy of that vertex to A t +\. 

Importantly, we never expose the same hyperedge twice, because the hyperedges exposed at 
time t have the property that their smallest labeled vertex is precisely vt- Therefore, the decisions 
are independent at each stage, and the number of vertices added to the queue is stochastically 
dominated by (k — 1) times the Binomial random variable Bin . In particular, if we 

define the random variables Y t = \A t \, then each successive difference Y t+ \ — Y t is stochastically 
dominated by (k — l)Bin ( fc ™ 1 ),p — 1. Therefore, if we define the infinite sequence Z t as Zq = 1, 

Z t+ i = Z t + (k — l)Bin ,p — 1, we may couple the probability spaces such that Y t < Z t until 

Y t hits (the breadth-first-search is exhausted). 

log n. Since a binomial random variable is the sum of independent Bernoullis, the 



Let T 



16fc 



sum of independent and identically distributed binomials is another binomial. Thus the distribution 
of Zt is precisely 1 + (k — l)Bin ( fc " 1 )T,p — T. The Chernoff bound will control the probability 
that Zt > 1, and this will be sufficient because if the integer Zt < 1, then the breadth-first- 
search must have completed, as Y t < Z t during it. Observe that Zt > 1 happens precisely when 



Bin 



(*-i)r,P 



Yet the expectation of this binomial is 



n 
k-l 



(k — 2V T 
r(l-0.8e)^ E=i ^<(l-0.8e), 



n" x k — 1 

so when Zt > 1, that binomial exceeds its expectation /ibya factor of at least 0.8e. Hence 

(0.8t) 2 

F [Z T > 1] < e s ^ . 

To continue, we need a lower bound on fi. At the end of the proof of the previous lemma, we 
showed that n > ^! implies that {^f > 1 - O.Ole. Since {^f' 1 > {^f \ and we assume 
e < i, we therefore have that 



"=G-i) r(i - £) ^ £<i - a8i£, ^T 2a 

Thus using T = ^ log n, we have 



595 



T 



k-l 



P [Z T > 1] < e 



.(°M1. . 595 .T _ 2 
3 k-l < n 



16fe 



logn. 



i.e., a fixed vertex v has probability at least 1 — n 2 of having its component size at most 
A final union bound over the n vertices yields the desired result. □ 

We now move to introduce the key parameter which characterizes the overall running time 
of our algorithm. This parameter has been successfully used to analyze various discrete random 
processes, ranging from percolation (where its name originated from statistical physics) to the 
theory of random graphs and stochastic coalescence processes. 
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Definition 3.6. The susceptibility of a given hypergraph H, denoted by x(H)> ^ s the expected 
size of the component which contains a uniformly random vertex. Equivalently, if the connected 
components are C\, C2, • • • , Ct, then x = r Yli | | 2 - 

It turns out that the susceptibility evolves smoothly with the number of edges m. The following 
theorem applies the Differential Equations method to estimate its growth. Its analysis builds upon 
the approach used in [21] , but improves the error bounds from exponential to polynomial (in both 
- and n). 

Theorem 3.7. Let < e < \ be given, and assume that > 40 °?° fc6 ■ Let m = (1 — e) k ^_^ ■ 

With probability at least 1 — 3W 1 , the random k-uniform hypergraph H n ^ m .^ has susceptibility at 
most 

1 200A; 3 log 3 n 

Proof. Define 

Consider the random hypergraph process Hq, Hi, . . . of Lemma 13.31 We will run this process 
to time T + logn which by Lemma 13.31 will contain H n ^ m -^ with probability at least 1 — re -1 , 
because logn < e • w^Li) • ^ therefore suffices to show that with probability at least 1 — 2n _1 , the 
susceptibility of Hx+iogn is at most ([2]). We track the evolution of susceptibility by defining X t to 
be the susceptibility of H t . Suppose that in the (t + l)-st round, the k vertices of the incoming 
hyperedge lie in components C\, . . . , C^, where some of the components may be repeated. Then, 
the susceptibility increases by at most 

I [ddi + . . . + \c k \) 2 - (|d| 2 + • • • + |c fc | 2 )] = - V |a.||c s | , 

r<s 

with equality only if C\, . . . , are distinct. Define the filtration To, T\, . . . such that Tt captures 
the outcomes up to and including time t. Let us bound E [Xt+i — X t \ T\. For this, let c\,...,c z 
be the component sizes after time t. Since our process selects k independent vertices for the next 
hyperedge, and the hyperedge is added only if they are distinct from each other, and form a new 
hyperedge, we then have 



E[X t+1 -X t \T t ]< V (2L. I*. ..!&). 1 V 



n / n 

Kr<s<k 
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This suggests that the evolution of X t may resemble that of the differential equation x'(9) = 
k{k — l)x(9) 2 , where we parameterize 8 = j. So, let us define 



x{9) 



l-fc(fc-l)0' 



which is the solution of that differential equation with initial condition x(0) = 1. We use x{8) to 
convert (X t ) into a supermartingale, defining 



Y t = X t -x[-)-f[-)A, 



n 



n 



where 



/(*) = 



1 



199 A: 3 log £ 



n 



{\-k{k-\)8f ' 
is the solution to the differential equation 



f'(9)=3k(k-l).x(9)f(9); /(0) = 1 . 

Also, define E t to be the event that (i) X t < x(^) + /(^)A and (ii) all components of H t have 
size at most ^ logn. Then, define the stopping time r to be either T, or the first moment that 
E t fails. Define 

Zt = Ymm{t,r} ■ 

Then, 



E[Z, 



t+l — *t I ->U &t 







and 



E[Z m -Z,| ^ )jE ; t ]<Mi_i) x 2 



x 



t + l 



n 



t 

x | — 

n 



t + l 



n 



A. 



which by convexity of x{8) and /(#) is at most 

E [Z m - Z t | Ti, £ t ] < fci) x t 2 - ix' f * ) - V f - ) A 

n n \n I n \n , 



< 



k(k-l) 
n 

k(k - 1) 

n 



77. 



* - +/ r A 



1 2 



2x 



77 



1^(1) -I, (I, A 

n \ n / n \ n . 



n 



A' 



n \n 



A. 



Our condition on n is essentially equivalent to the fact that A < e 2 . This implies that over the 
range < 9 < ^pErq > we always have 



f(9)A<x(0), 



(3) 



so 



E[Z m -Z t | Ji,^] < - 

77 



77 



3*(*-l)-s - / - -/' - 



77 



77 



o, 
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and we conclude that Zt is in fact a supermartingale. Furthermore, by part (ii) of the definition 
of Et, we know that the addition of a single hyperedge cannot increase the susceptibility by more 
than 



1 

n 



16k \ 2 (16k. n2 

K ■ — ^ n J ~ \ — 2~ S n 



^ 256/c 4 log 2 n 



e 4 n 



Since x(9) and f(9) are both increasing functions, this is an upper bound for the incremental change 
Zt+i — Z t . On the other hand, the susceptibility can never decrease, and on the range 6 < fcpjE~n > 
the derivatives x'(9) and f'(0) increase to k ^ 1 ^ and 3fc ( fc ~ 1 ) ; respectively. 

Since x(6) and f(9) are convex, we conclude that as t ranges from to T, the maximum one-step 
change in Zt is bounded by 

f256fc 4 log 2 n k(k-l) 1 3k(k - 1) A] 256/c 4 log 2 ra 
C = max <^ -. , k • - + 



4 ' 2 1 4 14 



Yet Zq = —A, so Hoeffding-Azuma (Theorem I3.2[) implies that 

f A 2 ] f 4nlog 2 n] , 
P [Zt > 0] < exp -— =- < exp * < 



Also, by Lemma 13.51 the probability that -ffy has a component with size exceeding log n is at 
most n . Hence with probability at least 1 — 2n _1 , we have that both Zt < and all components 
of Ht have size at most i^logra. When this happens, we must never have had any E t fail, and 
hence we conclude that Yt = Zt < 0, and so the susceptibility after T rounds is 



T\ /T\ . /l-e\ / 1-e \ . 1 1 199/t 3 log 



X T <x - +/ - A = x — +/ — A = - + ^ 



„3 



n 



n) \n J \k(k — 1) J \k(k — 1) J e e 3 e^^Jn 

Adding logn more rounds to reach time T + logra, we see that these can link at most klogn 
clusters, and since we conditioned on all clusters having size at most log n, this can further 
increase the susceptibility by at most 



1 / 16k \ 2 256/s 4 log 4 n k 3 lo^ 3 



- • [k log n • — T log n) = < — = 



by our initial assumption on the size of n. Therefore, with ^probability at least 1 — 2n , the total 
susceptibility after T + logn rounds is at most | + 200 ^ 7 ^| n , as required. □ 

We now combine all of our results to produce our main theorem, which provides a single high- 
probability bound for the final sum of squared component sizes in H n)m -k- 

Proof of Theorem 11.11 As explained in section 13.11 the time for inserting a key in a component 
of size s is 0(k 2 s). This means that if the final hypergraph contains a component of size s, it took 
only 0(Yli=i k 2 s) time to insert all edges of that component, i.e., 0(k 2 s 2 ) operations. Each edge 
is in exactly one component, and we recognize that summing the squares of the final component 
sizes gives exactly n times the final susceptibility. Thus, we can bound the total running time by 
0(k 2 n) times the final susceptibility, which by Theorem 13.71 is bounded by a constant with high 
probability. □ 
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